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© A disc array system has a plurality of disc 
storage devices (i.e., drives) constituting a parity 
group and mounted in a distributed manner on dif- 
ferent boards. Where any one drive in the same 
parity group develops a fault, the board carrying the 
faulty drive is withdrawn from the mother board for 
repair. If the CPU issues a write request for data 
held in the faulty drive or in any normal data on the 
same board, the requested data is recovered from 
the multiple data retained in the other drives con- 
stituting the same logical group and mounted on the 
other boards. The recovered data is sent to the CPU. 

If the CPU supplies write data to the faulty drive 
or to any normal drive on the same board, the 
supplied write data is held temporarily in a cache 
memory in an array controller. Later, when the board 
is repaired and replaced onto the mother board, the 
data to be held in the alternate drive having replaced 
the faulty drive is recovered from the data in the 
other drives, and the recovered data is written to that 
alternate drive. Then the write data in the cache 
memory is written to the alternate drive or to the 
target normal drive. 
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BACKGROUND OF THE INVENTION 

The present invention relates to a disc array 
system comprising a plurality of disc storage de- 
vices that retain a plurality of data together with 
error correction codes. 

In existing computer systems, the data needed 
by the CPU or by the similar high-order facility is 
hold in a secondary storage device. When neces- 
sary, the CPU or its equivalent accesses the sec- 
ondary storage device to write or read data thereto 
o* therefrom. The secondary storage device gen- 
c rally utilizes a nonvolatile storage medium and 
t.i-caUy constitutes a magnetic disc unit or an 

<a; c$*tK orvt Such disc units will each be 
' i-o *i ^ i«c drive hereunder. 

WiT?i K-u."- society getting increasingly in- 
* rrnatu.'o-orN-ntt'i computer systems require sec- 
■■*»-t.«ry st xa novices of correspondingly en- 
rw*rv »<i poriLwmancc; One proposed solution to this 
ffwiiimrp- ; , nw array setup composed of nu- 
rr*'ir>u* cii^c rtfivi?? of a relatively small capacity 
fjcirh 

The solution is discussed illustratively by D. 
Patterson. G Goson and R. H. Kartz, "A Case for 
Rodundant Arrays of Inexpensive Discs (RAID)," 
ACM SIGMD Conference, Chicago, IL, (June 1988), 
pp. 109-1 16 (nereinafter referred to as the first prior 
art). The paper reports on the performance and 
reliability of disc arrays (at level 3) for dividing data 
and parallelly processing the data divisions and of 
disc arrays (at levels 4 and 5) for distributing and 
independently handling data. 

The disc arrays at level 3 divide one write data 
item transferred from the CPU in connection with a 
write request, and generate parity data for error 
correction based on the divided data. The data 
divisions and the parity data are stored parallelly in 
a plurality of disc drives. When the CPU orders a 
read operation of the divided data, the data di- 
visions are read parallelly from the appropriate disc 
drives and are combined for transfer to the CPU. 

The disc arrays at levels 4 and 5 differ from 
those at level 3 in the following two aspects: that 
the divided data at level 3 are replaced by a 
plurality of data items that are used in connection 
with a plurality of write requests transferred from 
the CPU; and that the data are read in units of data 
items and, unlike at level 3, are not combined for 
transfer to the CPU. 

The multiple data items and the error correc- 
tion data generated from such data items are called 
an error correction data group, a parity group or a 
parity data group. Because the error correction 
data may be constituted not only by parity data but 
also by various other kinds of error correction data, 
the wording "parity group" is used in this specifica- 
tion even where the error correction data may not 
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be parity data. Although the error correction codes 
in this specification are composed of parity data, 
this invention may also be applied to cases where 
the error correction codes are something other than 

5 the parity data. 

In the description that follows, a plurality of 
disc drives accommodating the data of the same 
parity group are collectively called a logical group. 
The logical group constitutes a fault recovery unit. 

w If a disc drive of a logical group becomes faulty, 
the fault can be addressed using the data in the 
other disc drives within the same logical group. 
Illustratively, if a disc drive holding divided data 
has developed a fault and cannot be accessed for 

75 reading data therefrom, the data in the faulty disc 
drive can be reconstructed by use of the data and 
parity data held in the remaining disc drives of the 
same logical group. In systems comprising nu- 
merous disc drives such as disc array systems, 

20 growing numbers of component parts tend to in- 
crease the probability of fault. It is for the purpose 
of enhancing the reliability of disc drives that the 
parity scheme is employed for recovery from fault. 
Also proposed is a scheme whereby all data 

25 destined to a faulty drive is written to a spare drive 
until the faulty drive is replaced by an alternate 
drive. One such scheme is described in U.S. Pat- 
ent No. 5,077,736 (called the second prior art here- 
under). 

30 With conventional disc array systems of level 3 

(called the third prior art hereunder), a plurality of 
disc drives are connected by cables to an array 
controller. In this setup, the drives are connected to 
the array controller using as many buses as the 

35 number of drives constituting each different logical 
group. That is, the first drive of each of the dif- 
ferent logical groups is connected to the array 
controller via a common first bus, by use of cables 
such that the first drives make up a daisy chain 

40 when connected. The second drive of each dif- 
ferent logical group is connected likewise to the 
array controller via a common second bus, using 
cables such that the second drives also constitute 
a daisy chain. The other drives are connected in 

45 like manner. Where numerous drives make up the 
disc array system, the cable-based drive connec- 
tion is not very desirable because of floor space 
constraints and of reduced maintainability. What is 
desired then is a setup having many small drives 

so installed therein. 

Japanese Patent Laid-open No. Hei 3-108178 
(corresponding to U.S. Patent Application No. 
409495, filed on September 19, 1989) discloses a 
technique whereby a plurality of drives each having 

55 numerous pins at the bottom are mounted on a 
semiconductor board (the technique may be called 
the fourth prior art hereunder). The disclosed tech- 
nique is aimed at mounting numerous small drives 
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on the board. However, it is not disclosed whether 
or how these drives constitute logical groups. 

Japanese Patent Laid-open No. Hei 4-228153 
(corresponding to U.S. Patent Application No. 
502215, filed on March 30, 1990) discloses a tech- 
nique whereby a plurality of disc drives are moun- 
ted detachably on each of a plurality of boards 
having common wiring, the drives on one board 
being called a disc array (the technique may be 
called the fifth prior art hereunder). The multiple 
disc drives of each array belong to the same 
logical group. If any one drive on a given board 
develops a fault, the board carrying the faulty drive 
remains inserted in the system and the faulty drive 
is withdrawn from the board to be replaced by an 
alternate drive. In this case, the faulty drive re- 
mains accessible. Specifically, if a read request is 
generated to read data from the faulty drive, the 
data and parity data held in the other disc drives of 
the same logical group are used to provide the 
target data to be read. 

SUMMARY OF THE INVENTION 

With the fifth prior art outlined above, the ar- 
rays each comprising a plurality of disc drives are 
mounted on a common mother board so that nu- 
merous small drives are installed at high density. 
However, if any one drive develops a fault, the 
faulty drive must be withdrawn from the applicable 
board for replacement by an alternate drive. To 
withdraw any faulty drive requires providing a 
space wide enough for a user to gain manual 
access to that drive. That space requirement poses 
limits to arranging numerous drive-carrying boards 
at short intervals. 

It is therefore an object of the present invention 
to provide a disc array system having a plurality of 
disc drives mounted on a plurality of boards and 
getting those boards arranged on a common moth- 
er board at relatively short intervals wherein, if a 
fault occurs in any disc drive, the data held in the 
faulty drive may still be accessed by an upper 
device. 

In carrying out the invention and according to 
one aspect thereof, there is provided a disc array 
system comprising: a mother board; a plurality of 
boards mounted detachably on one side of the 
mother board and arranged substantially in parallel; 
a plurality of disc drives mounted on one side of 
each of the boards; and an array controller for 
controlling the writing and reading of data to and 
from disc drives on the boards via signal lines 
installed on the mother board; the array controller 
including: a write circuit for writing a plurality of 
data items and error correction codes used in 
connection with these data items to a plurality of 
disc drives each mounted on a different board; a 



read circuit used when a board carrying a faulty 
disc drive is withdrawn from the mother board and 
when an upper device accesses any disc drive on 
the withdrawn board for reading a target data item 

s therefrom, the accessed drive being either the faul- 
ty disc drive or a normal drive, the read circuit 
reading a plurality of data items belonging to the 
same error correction data group as that of the 
target data item together with the parity data in 

70 connection with the read data items, from a plural- 
ity of disc drives mounted on a plurality of boards 
other than the withdrawn board; and a recovery 
circuit for recovering the target data item from the 
read data items and from the parity data read in 

15 connection therewith. 

In a preferred structure according to the inven- 
tion, the disc array system further comprises a 
random access memory for temporarily accom- 
modating write data supplied by the upper device 

20 for a write operation to any one of the disc drives 
mounted on the withdrawn board carrying the faulty 
disc drive; and a write circuit used when the board 
having an alternate drive mounted thereon for re- 
placing the faulty disc drive and readied for data 

25 write operations is again mounted on the mother 
board, the write circuit writing to the alternate drive 
the data destined to the faulty disc drive but held in 
the random access memory, the write circuit also 
writing to any normal drive on the replaced board 

30 the data destined thereto but held in the random 
access memory, while the board is being with- 
drawn from the mother board for replacement of 
the faulty disc drive. 

In another preferred structure according to the 

35 invention, the disc array system further comprises: 
at least as many spare drives as the number of 
data-holding disc drives mounted on a single board 
separate from the boards carrying the disc drives 
for holding data; a write circuit used, when the 

40 board carrying the faulty disc drive is withdrawn 
from the mother board, for selecting one spare 
drive with respect to each of the disc drives moun- 
ted on the withdrawn board, the write circuit being 
further used, when the upper device supplies data 

45 to be written to any target disc drive mounted on 
the withdrawn board, for writing the supplied data 
to the spare drive selected with respect to the 
target disc drive; and a write circuit used when the 
board having an alternate drive mounted thereon 

so for replacing the faulty disc drive and readied for 
data write operations is again mounted on the 
mother board, the write circuit writing the data 
destined to the faulty disc drive to the spare drive 
selected with respect to the alternate drive. 

55 Other objects, features and advantages of the 

present invention will become apparent in the fol- 
lowing specification and accompanying drawings. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a schematic view of a disc array 
system practiced as a first embodiment of the 
invention; 

Fig. 2A is a view showing the arrangement of 
boards in the first embodiment of Fig. 1; 
Fig. 2B is a view depicting connectors mounted 
on the mother board of the first embodiment; 
Fig. 3 is a detailed circuit diagram of circuits in 
the first embodiment; 

Fig. 4 is a view of a typical logical group table 
for use with the circuits of Fig. 3; 
Fig. 5 is a view of a typical address table for use 
with the circuits of Fig. 3; 

Fig. 6 is a view illustrating how the first embodi- 
ment writes data; 

Fig. 7 is a view of a typical logical group table 

used by a disc array system practiced as a 

second embodiment of the invention; 

Fig. 8 is a view of a typical address table for use 

with the second embodiment; and 

Fig. 9 is a view of a typical address table used 

by a disc array system practiced as a third 

embodiment of the invention. 

DESCRIPTION OF THE PREFERRED EMBODI- 
MENTS 

(First embodiment) 

(1) Outline of the system 

Fig. 1 is a schematic view of a disc array 
system practiced as the first embodiment of the 
invention. In Fig. 1, the disc array system com- 
prises a plurality of disc drives 5 and an array 
controller 2 for controlling the operation of these 
disc drives. 

Reference numeral 1 represents a CPU that 
orders the writing and reading of data externally. 
The CPU 1 and the array controller 2 are con- 
nected by a plurality of channel paths 7. The array 
controller 2 includes: a plurality of (e.g., 4) channel 
path director boards 8A carrying distributed chan- 
nel path directors 8 for selecting channel paths 7; a 
plurality of (e.g., 4) channel path interface boards 
3A carrying channel interface circuits 3 for control- 
ling data transfer to and from the CPU 1; a plurality 
of (e.g., 4) cache memory boards 4A carrying 
distributed cache memories 4 which are nonvolatile 
semiconductor memories backed up by batteries; 
and a bus 37 for connecting the cache memory 
boards 4A with the channel interface boards 3A. 
The bus 37 is also connected to a plurality of (e.g., 
4) drive boards 5A each retaining a plurality of disc 
drives 5 and a drive interface circuit 20. 
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Fig. 2A illustrates the arrangement of boards in 
the first embodiment of Fig. 1. As shown, the 
boards described above are mounted on a com- 
mon mother board 37. In Fig. 2A, rectangles 370 
5 indicated on each board represent an LSI chip 
each. It should be noted that only a part of the 
rectangles 370 on the drive board 5A are LSI chips; 
the remaining majority 5 are the disc drives moun- 
ted on that board. The mother board 37 comprises 

w board connector groups 30A, 40A, 50A and 80A 
indicated in Fig. 2B. It is by means of these con- 
nectors that the boards are inserted detachably in 
the mother board. The boards are arranged sub- 
stantially in parallel with one another. 

75 The surface of the mother board supports the 

bus 37 for connecting the channel interface con- 
nector group 30A, the cache memory connector 
group 40A, and the drive board connector group 
50A. Also provided are signal lines 12 (Fig. 3) for 

20 connecting the channel director connector group 
80A with the channel interface connector group 
30A. 

One characteristic of the first embodiment lies 
in the fact that, when a fault has occurred in a disc 

25 drive on a drive board, that drive board together 
with its normal drives is withdrawn from the mother 
board 37 (i.e., the faulty drive not detached from 
the drive board) for replacing the faulty drive with 
an alternate drive, the drive board along with the 

30 alternate drive being replaced onto the mother 
board 37. The point is that if any disc drive has 
developed a fault, the drive board carrying the 
faulty drive need only be withdrawn from the moth- 
er board and inserted therein again after replace- 

35 ment of the faulty drive. This makes it possible to 
reduce the spacing between drive boards when 
they are mounted. Another characteristic of the first 
embodiment is that a plurality of disc drives moun- 
ted in a distributed manner on a plurality of drive 

40 boards constitute a logical group. Thus when any 
drive board is withdrawn from the mother board, 
the data held in any of the disc drives on the 
detached board may be recovered from the data 
and parity data held in the other drives mounted on 

45 the other drive boards within the same logical 
group. 

The logical group 9 is made up of m drives 5 
(illustratively four drives in the first embodiment) 
mounted on a plurality of (e.g., 4) boards. The 

50 logical group 9 is a fault recovery unit comprising 
the disc drives 5 that retain m-l error correction 
data groups (i.e., parity groups) composed of data 
and the parity data derived therefrom. 

Although the first embodiment comprises four 

55 channel interface boards 3 and four drive interface 
boards 6, the numbers of these components are 
not limitative of the invention. Likewise, the number 
of disc drives 5 mounted on each board and the 

4 
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number of disc drives 5 constituting one logical 
group 9 are not limitative of the invention; these 
counts may be selected as desired. 

Fig. 3 is a detailed circuit diagram of each 
board in the first embodiment The channel path 
directors 8 include interface adapters (IF Adp) 10 
connected to the respective channel paths 7 and a 
channel path switch 11. Although the channel path 
directors 8 are actually mounted in a distributed 
manner on four boards, Fig. 3 provides a com- 
prehensive diagram of the entire circuitry constitut- 
ing the directors 8. 

The channel interface circuits 3 each comprise: 
a channel interface (CH IF) 13 for effecting protocol 
conversion and transfer rate adjustment regarding 
the data transferred from the CPU 1 ; a data control 
circuit (DCC) 14 for controlling data transfer; a 
channel-side cache adapter (C Adp) 1 5 for control- 
ling the writing of data sent from the CPU 1 into 
the cache memory 4; a parity generator 18 for 
generating parity data from the divided data written 
to the cache memory 4; and a microprocessor 
(MP) 17 for controlling the configured circuits. The 
channel interfaces 13 are connected to the channel 
path switch 11 via data lines 12. The cache adapt- 
ers 15 control the bus 37 as well as the transfer of 
data between the CPU 1 and the cache memories 
4. 

The drive board 5A includes a drive-side cache 
adapter (C Adp) 19 and a drive interface circuit 
(Drive IF) 20. Reference numeral 16 represents 
control signal lines that connect the channel path 
switch 11, the channel interfaces 13, the data con- 
trol circuits 14, the channel-side cache adapters 15, 
the microprocessors 17, the cache memories 4, the 
drive-side cache adapters 19 and the drive inter- 
face circuits 20. The bus 37 comprises a control 
bus 37A for transferring control information and a 
data bus 37B for transferring data. 

In Figs. 1 and 3, the board 50A is for use with 
the second embodiment to be described later. That 
is, the board 50A carries spare drives 5. 

(2) Outline of the system operation in the absence 
of faulty disc drives 

The internal workings of the first embodiment 
will now be described with reference to Fig. 3. The 
first embodiment works primarily on the so-called 
RAID 3 control principles. 

The CPU 1 first issues a read or write com- 
mand. The command is input to a channel path 
director 8 in the array controller 2 via the channel 
paths 7. 

Given the command from the CPU 1, the chan- 
nel path director 8 in the array controller 2 checks 
to see if the command is acceptable. When the 
command is found to be acceptable, that command 



sent from the CPU 1 to the array controller 2 is 
input to the interface adapter (IF Adp) 10. The 
microprocessor (MP) 17 performs command ac- 
ceptance processing. The first embodiment writes 

5 and reads data primarily in accordance with the 
conventional RAID level 3 scheme. 

Fig. 6 depicts how the first embodiment writes 
data as per the RAID level 3 scheme. In Fig. 6, 
reference characters BADRi (i = 1, 2, 3 or 4) 

io represent the board addresses of four drive boards 
5A. Reference characters SD#1 (i = 1, 2, 3 or 4) 
denote the drive address specifying one disc drive 
on each of the boards. 

In the description that follows, the board whose 

75 board address is, say, BADR1 is called the board 
BADR1. Likewise the drive whose drive address is 
SD#1 on a board is called the drive SD#1. 

The drives SD#1 on the boards BADR1 
through BADR4 are assumed to constitute a logical 

20 group. It is also assumed that the disc drives on 
the board BADR4 are used to store parity data and 
that the disc drives SD#1 on the other boards 
BADR1 through BADR3 are used to accommodate 
data. 

25 It is further assumed that the command from 

the CPU 1 is a data write command and that the 
write data sent from the CPU 1 is represented by 
D#1 in Fig. 6. D#1 is a logical address that speci- 
fies the write data. The data specified by the logi- 

30 cal address D#1 will be called the data D#1 here- 
under. 

With the first embodiment, the length of the 
data written or read by the CPU 1 is assumed to 
be fixed at 4 KB. Thus the write data D#1 from the 

35 CPU 1 is 4 KB long. It should be noted that the 
data length is not limitative of the invention; any 
other appropriate data length may be adopted for 
the invention. 

The write data D#1 from the CPU 1 is held in 

40 the cache memories 4 in Fig. 1. Under the RAID 3 
scheme, the write data D#1 thus accommodated is 
divided into single-byte data units from the begin- 
ning of the data, and the data units are distributed 
to the three data storage drives (SD#1 on BADR1, 

45 BADR2 and BADR3) out of the four constituting the 
logical group. The divided data units are called sub 
data having sub data names D#1-1, D#1-2 and 
D#l-3. The sub data whose sub data name is D#1- 
1 will be called the sub data D#1-1 hereunder. 

so Parity data is generated from the sub data follow- 
ing the division. Parity data generation involves 
processing the corresponding bytes of the sub 
data. Specifically, the first byte of the sub data 
D#1-1, the first byte of the sub data D#1-2 and the 

55 first byte of the sub data D#1-3 are computed to 
give a parity data item; the second byte of the sub 
data D#1-1, the second byte of the sub data D#1-2 
and the second byte of the sub data D#1-3 are 
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computed to furnish another parity data item; and 
so on. The parity data items thus generated are 
arranged consecutively to constitute parity data 
P#1. 

The sub data D#1-1, D#1-2, D#1-3 and the 
parity data P#1 are written in a distributed manner 
to the four disc drives making up the logical group. 
The data from the CPU 1 is written in the above- 
described manner. 

When the CPU 1 issues a data read request, 
the logical group holding the requested data is 
accessed for reading therefrom the sub data con- 
stituting the target data. These sub data are com- 
bined and transmitted via the cache memories 4 to 
the CPU 1 . 

(3) Address translation table 

The address translation table necessary for the 
above-described writing and reading of data will 
now be described in detail. 

For the disc drives 5 constituting the disc array 
system, the first embodiment adopts SCSI interface 
drives. In addition, the CPU 1 of the first embodi- 
ment issues write and read commands without 
getting aware of disc arrays. That is, the CPU 1 is 
not aware whether the disc drives to which to issue 
commands are of conventional type or are part of a 
disc array system. This is because the disc drives 
widely used today are of conventional type so that 
the operating systems (OS) are not designed to be 
aware of disc arrays. Thus with the first embodi- 
ment, the CPU 1 operates on the assumption that it 
writes and reads data to and from conventionally 
configured disc drives; the disc array system car- 
ries out its own processing associated with disc 
arrays. For this reason, the CPU 1 issues write and 
read commands using a conventional interface ar- 
rangement. 

More specifically, when the CPU 1 sends a 
data write command to the array controller 2, the 
CPU 1 transfers the target write data together with 
its logical address (data name or data number). 
The array controller 2 divides the write data into 
sub data as described with reference to Fig. 6, and 
stores these sub data as well as the associated 
parity data to the disc drives of the logical group 9. 
At this time, the sub data and parity data are given 
addresses (D#1-1 through D#1-3, P#1, etc. as ex- 
plained by referring to Fig. 6) for processing by the 
array controller 2. The logical address furnished by 
the CPU 1 and the addresses prepared by the 
array controller 2 are matched in what is known as 
the address translation table, to be described be- 
low. 

When the CPU 1 issues a data read command 
to the array controller 2, the CPU 1 furnishes the 
array controller 1 with the applicable logical ad- 
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dress for a read operation by the latter. Using the 
address translation table, to be described below, 
the array controller 2 translates the furnished logi- 
cal address into appropriate addresses used within 
5 the disc array system. These addresses are re- 
ferenced in order to read the sub data which are 
then combined and transferred to the CPU 1. 

The address translation table is constituted by 
a logical group table 21 and an address table 31 . 

io Fig. 4 illustrates a typical structure of the logi- 

cal group table 21. Logical addresses 22 are each 
an address designated by the CPU 1 (i.e., data 
name or data number). Sub data names 24 are 
acquired by dividing data having a given logical 

15 address 22. Parity data names 25 are prepared on 
the basis of the corresponding sub data. Logical 
group numbers 23 are each the number of a logical 
group 9 that currently stores or is expected to store 
the sub data and parity data. 

20 In an initial state of the address translation 

table in which no data is written, the column of 
logical group numbers 23 has predetermined logi- 
cal group numbers set therein as initial values. In 
the initial state, the columns of logical addresses 

25 22, of sub data names 24 and of parity data names 
25 are all left blank in the table. 

Fig. 5 depicts a typical structure of the address 
table 31. The address table 31 retains detailed 
address information about what data has been (or 

30 is to be) written in which location of which drive 
belonging to which logical group. 

Logical addresses 22 in the address table 31 of 
Fig. 5 are the same as the logical addresses 22 in 
the logical group table 21 of Fig. 4. These are 

35 logical addresses (data names or data numbers) 
each designated by the CPU 1. Logical group 
numbers 23 in the address table 31 of Fig. 5 are 
the same as the logical group numbers 23 in the 
logical group table 21 of Fig. 4. Reference numeral 

40 30 represents SCSI drive address information 
about the SCSI drive to which data of the cor- 
responding logical addresses has been (or is to 
be) written. 

The SCSI drive address information 30 com- 
45 prises four columns of address information. Each 
column includes a board address 27, drive num- 
bers 28, fault flags 29 and a board withdrawal flag 
100. The board withdrawal flag 100 indicates 
whether or not the board having the listed address 
50 is withdrawn from the mother board. Reference 
numeral 32 denotes inter-drive addresses each in- 
dicating what data has been (or is to be) written in 
which location of which drive. A board address 27, 
a drive number 28 and an inter-drive address 32 
55 corresponding to a given logical address 22 com- 
bine to denote the location to which the data at that 
logical address 22 (sub data and parity data) has 
been (or is to be) written in which drive on which 

6 
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board. 

The board withdrawal flag 10 for each board is 
set to 1 when that board is withdrawn from the 
mother board 37. When withdrawing a board, the 
user gets the CPU 1 to issue to the array controller 
2 a command indicating that the board has been 
withdrawn. Responding to the issued command, 
the microprocessor 17 sets the board withdrawal 
flag 10 corresponding to the indicated board. Later, 
when the detached board is replaced onto the 
mother board, the user also gets the CPU1 to 
notify the array controller 2 using a command that 
the board in question has been replaced. In re- 
sponse, the array controller 2 performs recovery 
processing, to be described later, and resets the 
board withdrawal flag 100 corresponding to the 
replaced board. 

The fault flag 29 indicates whether or not the 
corresponding disc drive has developed a fault. 
That is, a fault flag 29 remains off (at 0) when 
everything is normal, and is turned on (i.e., set to 
1) if the corresponding disc drive on the corre- 
sponding board has developed a fault and is in- 
accessible for read or write operations thereto. 

The four columns under the SCSI drive ad- 
dress information 30 correspond respectively to the 
columns under the sub data name 24 and to the 
column under the parity data name 25 in the logi- 
cal group table 21 of Fig. 4. 

Because the first embodiment has its data 
stored in parallel in a plurality of regions having the 
same inter-drive address 32, it is preferable to 
synchronize the revolutions of all disc drives 5 
within the same logical group 9. 

As described, the first embodiment reads and 
writes data using the address table 31 and logical 
group table 21. 

(4) System operation upon disc drive fault 

With the first embodiment, as shown in Fig. 6, 
each logical group 9 comprises a plurality of disc 
drives mounted on a plurality of boards (e.g., 
BADR1, BADR2, BADR3, BADR4). The address 
translation tables depicted in Figs. 4 and 5 indicate 
which disc drives on which boards constitute what 
logical group, the indication being given by having 
the address of each disc drive supplemented with 
the applicable board address. 

Suppose that the first embodiment attempts a 
number of times to access a disc drive on, say, the 
board BADR 1 for writing or reading data thereto or 
therefrom. If a predetermined number of retries 
turn out to be unsuccessful, the microprocessor 17 
recognizes a fault of the board BADR1 and, in the 
address table 15, turns on (i.e., sets to 1) the fault 
flag 29 corresponding to the disc drive SD#1 on 
the board BADR1. With the disc drive SD#1 in 



defective state, the disc array system remains 
operational even if the mother board 37 still retains 
the board BADR1 carrying that faulty drive or if the 
defective board is detached from the mother board 

5 37 for repair. 

Specifically, when the CPU 1 issues a data 
read request pertaining to data held in any of the 
disc drives SD#1 on the boards BADR1 - BADR4, 
the microprocessor 17 checks the address table 

10 15. In the manner described in connection with the 
address translation table, the microprocessor 17 
translates the logical address designated by the 
CPU 1 into the address applicable within the disc 
array system. At this point, the microprocessor 17 

75 checks two flags: the fault flag 29 corresponding to 
the disc drive SD#1 on the board (BADR1 in the 
above example) to which the translated address is 
assigned, and the board withdrawal flag 100 cor- 
responding to the board carrying that disc drive. If 

20 the fault flag 29 regarding the disc drive SD#1 is 
found to be on (i.e., set to 1) and if the board 
withdrawal flag 100 applicable to the board BADR1 
carrying that disc drive is off (set to 0), the micro- 
processor 17 recognizes two things: that the disc 

25 drive in question has developed a fault, and that 
the board BADR1 carrying that drive is still held 
inserted in the mother board 37. 

Having recognized the faulty disc drive in the 
above manner, the microprocessor 17 recovers the 

30 data in the disc drive SD#1 on the board BADR1 
through the use of the data and parity data asso- 
ciated with the logical group to which the drive in 
question belongs. Specifically, the microprocessor 
17 retrieves a plurality of sub data and parity data 

35 from the drives SD#1 on the other boards BADR2 - 
BADR4. Using the retrieved sub data and parity 
data, the microprocessor 17 gets the internal parity 
generator 18 to recover the sub data in the faulty 
disc drive SD#1. The recovered sub data is com- 

40 bined with the multiple sub data retrieved earlier so 
as to generate the requested data which is then 
sent to the CPU 1 . It is by a known technique that 
the parity generator 18 recovers the data in the 
faulty disc drive SD#1 . 

45 As described, the parity generator (PG) 18 

generates parity data from sub data when target 
data is written to disc drives. When the sub data is 
recovered from both sub data and parity data, on 
the other hand, the parity generator 18 acts as a 

50 data recovery circuit. 

If the board withdrawal flag 100 corresponding 
to the board BADR1 carrying the disc drive SD#1 
is found to be on (set to 1), the microprocessor 17 
recognizes that the board BADR1 has been with- 

55 drawn from the mother board 37. In that case, the 
microprocessor 17 is unable to read data from all 
disc drives, with or without fault, mounted on the 
board BADR1. Thus in the same manner as that of 
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handling the faulty drive SD#1 described above, 
the microprocessor 17 recovers data in the un- 
successfully accessed drive from the data in the 
other disc drives belonging to the same logical 
group. 

If the CPU 1 issues a data write request for 
updating data in the disc drive SD#1, the micropro- 
cessor 17 checks the address table as in the case 
of the read request. Following the procedure dis- 
cussed in connection with the address translation 
table, the microprocessor 17 translates the logical 
address designated by the CPU 1 into the address 
applicable within the disc array system. At this 
point, the microprocessor 17 checks the fault flag 
29 corresponding to the disc drive in question and 
the board withdrawal flag 100 applicable to the 
board carrying that disc drive. 

If the fault flag 29 corresponding to the disc 
drive SD#1 is found to be on (set to 1) and if the 
board withdrawal flag 100 applicable to the board 
BADR1 carrying the disc drive SD#1 is off (set to 
0), the microprocessor 17 recognizes that the disc 
drive SD#1 has developed a fault and that the 
board BADR1 carrying that drive SD#1 stays in- 
serted in the mother board 37. In that case, the 
microprocessor 17 accommodates the write data 
inside the cache memories 4. 

Later, when the disc drive SD#1 on the de- 
tached board BADR1 is repaired and the board 
BADR1 is replaced onto the mother board 37, the 
microprocessor 17 reads the sub data and parity 
data belonging to all error correction data groups of 
the other disc drives that belong to the same 
logical group as that of the normal alternate disc 
drive replacing the faulty drive (drive SD#1 on the 
board BADR1 in this example). Using the sub data 
and parity data thus read, the microprocessor 17 
recovers all data in the faulty disc drive. The recov- 
ered data is stored in the normal alternate disc 
drive replacing the faulty drive. 

Thereafter, the multiple write data held in the 
cache memories 4 are written to the alternate disc 
drive or to the other appropriate normal drives on 
the board BADR1. The writing of these write data is 
carried out under the RAID 3 scheme. That is, each 
write data item is divided into a plurality of sub 
data from which parity data is generated. One of 
the plurality of sub data is written to one disc drive 
on the board; the other sub data and the parity 
data generated above are written to a plurality of 
disc drives on the other boards. 

When the detached board BADR1 is replaced 
onto the mother board 37, the above-described 
scheme recovers all data in the normal drives on 
the replaced board BADR1. The scheme also re- 
covers the data for which no write request was 
made while the board BADR1 was withdrawn from 
the mother board 37. This scheme is effective 
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where most of the data in the normal drives has 
been updated in response to numerous write re- 
quests issued while the board BADR1 was de- 
tached. However, the scheme takes time need- 
5 lessly for recovery processing if it recovers the 
write data already held in the normal drives for 
which few write requests were issued while the 
board BADR1 was withdrawn from the mother 
board 37. 

w This flaw is circumvented by not recovering the 

write data already held in the normal drive for 
which only a small number of write requests were 
issued while the board BADR1 was detached. In- 
stead, only the data designated by the write re- 
ts quests from the upper device with the board de- 
tached are recovered. 

If the board withdrawal flag 100 regarding the 
board BADR1 carrying the disc drive SD#1 is 
found to be on (set to 1), the microprocessor 17 
20 recognizes that the board BADR1 has been with- 
drawn from the mother board 37. In that case, the 
microprocessor 17 is unable to write data to all 
drives, with or without fault, on the board BADR1 . 
Thus in the same manner as that of handling the 
25 faulty drive SD#1 described above, the micropro- 
cessor 17 places in the cache memories 4 the data 
to be written to that drive SD#1 . 

Later, when the disc drive SD#1 on the board 
BADR1 is replaced by a normal alternate drive and 
30 the board BADR1 is mounted back onto the mother 
board 37, the microprocessor 17 reads the sub 
data and parity data belonging to all error correc- 
tion data groups of the other disc drives that be- 
long to the same logical group as that of the 
35 normal alternate disc drive replacing the faulty 
drive. Using the sub data and parity data thus read, 
the microprocessor 17 recovers all data in the 
faulty disc drive and writes the recovered data to 
the alternate disc drive. Thereafter, the write data 
40 held in the cache memories 4 is written to its 
appropriate destination. 

With the first embodiment, it is impossible to 
access all normal drives on the board BADR1 as 
long as that board is detached from the mother 
45 board 37. While the board BADR1 carrying the 
faulty drive remains withdrawn from the mother 
board, it is necessary to write or read data to or 
from any of these normal disc drives in the same 
manner as with the faulty drive. However, when a 
so data write request is issued to one of the normal 
drives on the detached board and the board is later 
replaced onto the mother board 37, it is not neces- 
sary to recover the data in the target normal drive 
as has been the case with the normal drive having 
55 replaced the faulty drive. This is because the target 
normal drive not having replaced the faulty one 
keeps intact its data held before the board was 
withdrawn. 

8 
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Because the newly generated write data is 
retained in the cache memories 4, the write data 
generated while the board was being detached 
from the mother board 37 is easily written to any 
normal disc drive on the replaced board. 

Because the cache memories 4 hold the write 
data, a read request that may be issued later by 
the CPU 1 to read the retained data causes them 
to be read from the cache memories 4 for direct 
transfer to the CPU 1 . This effectively amounts to 
increasing the data read speed. 

As described, where a drive board carrying a 
faulty disc drive is withdrawn from the mother 
board, the first embodiment can continuously han- 
dle data read and write requests destined to the 
faulty disc drive. 

(First variation of the first embodiment) 

Where a board carrying a faulty disc drive (i.e., 
SD#1) is detached from the mother board, a data 
write request issued by the CPU 1 to write data to 
the faulty drive SD#1 (or to any other normal disc 
drive on the same board) may be processed alter- 
natively as follows: 

The write data is divided into sub data and 
parity data is generated in conjunction therewith, in 
the same manner as when the board BADR1 is still 
attached to the mother board. When the board 
BADR1 is withdrawn from the mother board, the 
sub data or the newly generated parity data is 
written to the disc drives SD#1 or to other drives 
on the boards BADR2 - BADR4 other than the 
board BADR1 carrying the faulty drive SD#1 (or 
other normal drives); the data-receiving disc drives 
here constitute the logical group to which the faulty 
drive SD#1 (or other normal drives) on the board 
BADR1 belongs. The sub data to be written to the 
drive SD#1 or to other normal drives on the de- 
tached board BADR1 is discarded. 

Later, when the board BADR1 is again inserted 
in the mother board 37, data is recovered for all 
drives mounted on the replaced board BADR1 as 
in the case of the first embodiment. In this recov- 
ery process, the initial data in the faulty drive and 
the write data furnished to service the write request 
issued while the board is being detached are re- 
covered inside the alternate drive replacing the 
faulty drive on the board in question. At the same 
time, it is also possible to recover in the normal 
disc drives on the same board the data held by 
these normal drives since before the occurrence of 
the fault as well as the write data generated in 
connection with the write request issued while the 
board was being detached. 

With the first variation of the first embodiment, 
it takes time to recover all data in the disc drives if 
the drives are numerous on the detached board. 



However, one advantage of the first variation is that 
there is no need to retain the new write data in the 
cache memories 4 as has been the case with the 
first embodiment. This means that the cache 
5 memories 4 may be reduced in size. 

(Second variation of the first embodiment) 

Suppose that, as in the case of the first vari- 

io ation of the first embodiment, the CPU1 issues a 
data write request while the board BADR1 carrying 
the faulty drive is being withdrawn from the mother 
board. In that case, the data is divided into a 
plurality of sub data and parity data is generated in 

15 conjunction therewith. Of these sub data, those 
other than the sub data destined to the disc drives 
on the board BADR1 along with the generated 
parity data are written to a plurality of disc drives 
on different boards other than the board BADR1. 

20 The difference between the first and the second 
variation of the first embodiment is that with the 
second variation, if one of the sub data is destined 
to any normal disc drive on the board BADR1 , that 
sub data is retained in the cache memories 4. 

25 Later, when the board BADR1 is replaced onto the 
mother board, the second variation of the first 
embodiment recovers all data to be held in the 
alternate disc drive that replaces the faulty drive on 
the board BADR; the recovery of the data is 

30 achieved by use of the data in the other drives. 
The sub data in the cache memories 4 is written to 
the normal drives on the board BADR1 as in the 
case of the first embodiment. 

The second variation of the first embodiment 

35 requires a smaller quantity of sub data to be held 
in the cache memories 4 than the first embodi- 
ment. When the board BADR1 is replaced onto the 
mother board, the second variation has no need for 
the recovery of data for the normal drives on the 

40 board BADR1 as carried out by the first variation. 

(Third variation of the first embodiment) 

Whereas the first embodiment uses the bus 37 
45 as a high-speed bus for shared use by a plurality 
of drive boards, another alternative is to utilize a 
dedicated bus arrangement that connects each 
drive board to the array controller 2 on a one-to- 
one basis. Such a dedicated bus arrangement al- 
so lows a plurality of disc drives 5 to operate parallelly 
on a concurrent basis. 

(Second embodiment) 

55 The second embodiment is a modification of 

the first embodiment as it is supplemented by a 
spare board 50A shown by broken line in Fig. 1 or 
3. In Fig. 2B, a spare board connector is des- 
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ignated as 500A. The spare board 50A is identical 
in structure to the other drive boards. The spare 
board carries the same number of disc drives as 
each of the other boards. With the second embodi- 
ment, each spare drive corresponds to each of a 
plurality of logical groups constituted by the drives 
on the other multiple boards. If any one drive of a 
given logical group develops a fault, the spare 
drive corresponding to that group replaces the faul- 
ty drive. In the description that follows, any drive 
board other than the spare board will be assigned a 
number BADRi (i = 1 , 2, 3 or 4) as in the first 
embodiment, and the spare board will be given a 
board number BADR5. 

With the spare drives furnished as outlined, the 
logical group table 21 in Fig. 4 for the first embodi- 
ment is replaced by the logical group table 21 in 
Fig. 7 for the second embodiment. The logical 
group table 21 in Fig. 7 includes a data name 
column 26 for holding the names of the data to be 
stored in the spare drives. The address table 31 in 
Fig. 5 for the first embodiment is replaced by the 
address table 31 in Fig. 8 for the second embodi- 
ment. The address table 31 of Fig. 8 differs from 
that of Fig. 5 in that the total number of drives 
constituting each logical group is 5 including one 
spare drive. 

Described below are the workings of the sec- 
ond embodiment in effect when any disc drive 
develops a fault. It is assumed that, as an example, 
the drive SD#1 on the board BADR1 has devel- 
oped a fault. 

(A) System operation related to the faulty drive 

(A1) Where the board carrying the faulty drive is 
still inserted in the mother board 

(A1a) Reading data from the faulty drive 

Suppose that, while the board BADR1 carrying 
the faulty drive is still inserted in the mother board 
37, the CPU 1 has issued a data read request for 
reading data partially including the sub data held in 
the faulty drive. In that case, as with the first 
embodiment, the second embodiment recovers the 
sub data in the faulty drive from the sub data and 
parity data in the other drives belonging to the 
same logical group as that of the faulty drive. In 
recovering the target sub data, the second embodi- 
ment reads the sub data from the drives SD#1 on 
the boards BADR2 and BADR3 as well as the 
parity data from the drive SD#1 on the board 
BADR4. The recovered sub data is combined with 
the other two retrieved sub data for transfer to the 
CPU 1. 

The subsequent workings make the second 
embodiment different from the first. Specifically, 



the recovered sub data is written to the spare drive 
furnished for the logical group in question, i.e., to 
the drive SD#1 on the board BADR5 in the above 
example. Thereafter, the spare drive takes over 

5 from the faulty drive in dealing with that particular 
sub data. That is, if the CPU 1 issues another data 
read request for reading the data designated by the 
preceding data read request, or if the CPU 1 issues 
a data update request regarding the same data, the 

10 read or write operation is carried out on the four 
normal disc drives including the spare drive. 

Needless to say, address SADR1 of the spare 
drive SD#1 on the spare board BADR5 is updated 
appropriately in the address table 31 A so that the 

75 spare drive SD#1 will take over from the faulty 
drive. 

As is clear from the description above, the use 
of spare drives allows the second embodiment to 
dispense with recovering sub data if the data once 

20 accessed is again accessed, whereas sub data 
recovery is required of the first embodiment. And 
unlike the first embodiment, the second embodi- 
ment in updating the data may execute the write 
request without retaining the write data in the 

25 cache memories 4. 

(A1b) Write operation on the data in the faulty drive 

Suppose that, where a board carrying a faulty 

30 drive is mounted on the mother board 37, the CPU 
1 issues a data write request for the data including 
the sub data held in the faulty drive. In that case, 
unlike the first embodiment, the second embodi- 
ment executes the write request immediately. 

35 That is, as in the case of the normal write 

operation, the write data is divided into three sub 
data and parity data is generated in conjunction 
therewith. The difference from the normal write 
operation is that, of these sub data, the sub data to 

40 be written to the faulty drive SD #1 on the board 
BADR1 is written to the drive SD#1 on the spare 
board BADR5. 

Needless to say, the address table 31 A is 
updated appropriately so that the spare drive SD#1 

45 will take over from the faulty drive. 

(A2) System operation in effect when the board 
carrying a faulty drive is detached from the mother 
board 

50 

In this case, the operations for writing and 
reading data to and from the faulty drive are the 
same as the data read operation (Ala) and as the 
data write operation (A1b) with the faulty drive 
55 mounted on the mother board. 
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(A3) System operation in effect when the board 
carrying a faulty drive is replaced onto the mother 
board 

Where the board BADR1 with its faulty drive 
replaced by a normal drive is inserted back into the 
mother board 37, the second embodiment recovers 
the data in the initial faulty drive in the same 
manner as with the first embodiment. The recov- 
ered data is written to the normal disc drive that 
has replaced the faulty drive. During recovery, 
there is no need to use the data held in the spare 
drives. This scheme also makes it possible to 
recover the data written anew from the CPU 1 after 
the fault occurred in the drive. 

It may happen that a faulty drive on the spare 
board BADR5 is replaced by another spare drive. 
In that case, the new spare drive will thereafter 
replace any of the other drives constituting the 
same logical group if the latter drive has developed 
a fault. 

(B) System operation related to the normal drives 
on the board carrying the faulty drive 

(B1) Where the board having the faulty drive is 
mounted on the mother board 

As in the case of the first embodiment, the 
normal drives on the board carrying the faulty drive 
are used as they are continuously. 

(B2) System operation in effect when the board 
carrying the faulty drive is detached from the moth- 
er board 

In this case, the normal drives on the board 
carrying the faulty drive cannot be accessed. This 
makes it necessary to carry out the operation de- 
scribed in (A1) above in connection with the work- 
ings of the faulty drive. 

(B3) System operation in effect when the board 
carrying the faulty drive is replaced onto the moth- 
er board 

Of all the data held in any normal drive on the 
detached board carrying the faulty drive, the initial 
data in that normal drive to which a write request 
has been issued after withdrawal of the board from 
the mother board cannot be used. As mentioned 
earlier, the data generated in response to the write 
request and destined to be written to the normal 
drive is already written in the spare drive cor- 
responding to that normal drive. When the board 
that had the faulty drive is replaced onto the moth- 
er board 37, the effective data held in the spare 
drive corresponding to the normal drive in question 



is moved to the now normal drive on the same 
board. This completes the recovery of the data for 
any normal drive on the detached board having 
carried the faulty drive. 

5 If the CPU 1 issues a read request regarding 

data in a normal drive on the currently detached 
board carrying the faulty drive, the data in that 
normal drive is recovered and placed in the spare 
drive corresponding to that normal drive, as de- 

io scribed. Thus the data recovered in response to 
such a read request is also moved from the spare 
drive to the normal drive. It should be noted, how- 
ever, that the data thus recovered in response to 
the read request is the same as the data held 

75 initially in the normal drive. Thus the above move- 
ment of the data leaves intact the data in the 
normal drive. 

Unlike the first embodiment, the second em- 
bodiment has spare drives that accommodate the 

20 data recovered in response to the read request 
issued by the CPU 1 while the drive board in 
question was being detached from the mother 
board 37. This means that if the CPU 1 later 
requests any of the data in the spare drives, the 

25 data is immediately available. As opposed to the 
first embodiment that holds write data in the cache 
memories 4, the second embodiment accommo- 
dates write data in the spare drives. The second 
embodiment thus can retain much more write data 

30 than the first embodiment. 

(Variations of the second embodiment) 

A number of variations of the second embodi- 
35 ment discussed below may be conceived in con- 
nection with the recovering of data after the board 
that had a faulty drive is replaced onto the mother 
board. 

40 (1) Variations of the second embodiment in con- 
nection with the recovery of data in the faulty drive 

(1a) With the above-described second embodi- 
ment, all data to be retained in the drive having 

45 replaced the faulty drive is recovered by use of 
a plurality of sub data and parity data in the 
other drives within the same logical group to 
which the faulty drive belongs. The data thus 
recovered is stored in the drive that has re- 

50 placed the faulty drive. 

The spare drive that replaces the faulty 
drive already contains part of the data to be held 
in the faulty drive. Thus when the second em- 
bodiment is to recover data destined to the drive 

55 that has replaced the faulty data, the data other 
than that in the spare drive may be recovered 
and stored in the drive having replaced the 
faulty one; the data already retained in the spare 
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drive is moved to the drive that took over from 
the faulty drive. The scheme of this variation 
achieves data recovery more quickly than the 
second embodiment. This scheme is effective 
where the spare drive contains large quantities 
of data. 

(1b) With the second embodiment, when the 
board that had the faulty drive is replaced onto 
the mother board, the data destined to be held 
in the initial faulty drive is restored into the drive 
that has replaced the faulty drive. The spare 
drive . used for the faulty drive is released anew 
as a spare drive. This scheme may be replaced 
by another alternative scheme whereby the data 
to be held in the faulty drive is recovered and 
written to the corresponding spare drive, the 
spare drive being used thereafter as a normal 
drive. In that case, of all data to be retained in 
the initial faulty drive, the data yet to be stored 
in the spare drive may selectively be recovered 
and written to the latter drive. This procedure is 
preferred because it shortens the time required 
for data recovery. 

This alternative scheme is effective where a 
significant portion of the data to be held in the 
initial faulty drive is already retained in the spare 
drive. 

(1c) With the second embodiment, when the 
upper device issues a read request regarding 
data in the faulty drive, the issuance of the 
request causes the data to be recovered and 
stored into the corresponding spare drive. Alter- 
natively, the process of storing the recovered 
data into the spare drive may be omitted. That 
is, the spare drive retains only the write data 
supplied by the upper device while the board 
carrying the faulty drive is being detached from 
the mother board. Obviously, after the board is 
replaced onto the mother board, the data in the 
faulty drive may be recovered and placed in the 
spare drive which in turn may be used in place 
of the faulty drive. 

(1d) Another alternative is that, if any drive de- 
velops a fault, the upper device may imme- 
diately inhibit access to the disc array in ques- 
tion. When all data in the faulty drive is recov- 
ered and stored in the corresponding spare 
drive, the spare drive may be used in place of 
the faulty drive. 

(2) Variations of the second embodiment in con- 
nection with normal drives on the board carrying a 
faulty drive 

(2a) With the second embodiment, after the 
board that had the faulty drive is replaced onto 
the mother board, the data in the spare drives 
corresponding to the normal drives on the same 



board is all moved back to the normal drives. It 
should be noted, however, that the data in the 
spare drives includes data portions recovered in 
response to read requests issued for data in a 
5 normal drive on the board carrying the faulty 

drive while that board was being detached. Such 
data is the same as the data which is already 
retained in the normal drive and which has been 
in effect since before the board was detached. 
w Thus the movement of data from the spare drive 
to the normal drive in the second embodiment 
should preferably exclude the data recovered in 
response to such read requests. This scheme is 
desirable where the movement of data is to be 

75 accomplished more quickly. 

To implement the above scheme requires 
setting to the address table the information in- 
dicating, when data is written to the spare drive, 
whether that data has been written in response 

20 to a read request. In accordance with that in- 
formation, whether or not to move the data from 
the spare drive to the normal drive on the re- 
placed board may be controlled. This scheme is 
effective where there exists more data written in 

25 the spare drive in response to write requests 
from the CPU 1 than the data written thereto in 
response to read requests from the CPU 1 . 
(2b) With the second embodiment, when the 
board that had the faulty drive is replaced onto 

30 the mother board, the data written in the spare 
drive is moved therefrom to the corresponding 
normal board. Alternatively, the data to be held 
in the replaced normal drive may be recovered 
and stored into the spare drive which in turn 

35 may be used thereafter as a normal drive. In 
that case, of all data to be held in the normal 
drive, the data yet to be retained in the spare 
drive is to be recovered selectively and then 
written to the spare drive. This scheme is effec- 

40 tive where a considerable portion of the data to 
be held in the normal drive is already retained in 
the spare drive. 

(Third embodiment) 

45 

Whereas the first and the second embodiment 
are each a disc array system operating on the 
RAID 3 control principles, the third embodiment of 
the invention is a disc array system that adopts the 

50 RAID 5 scheme for its control. Thus although the 
third embodiment has the same system configura- 
tion as that in Fig. 1, the workings of the micropro- 
cessor 17 for control of the system differ from 
those for control of the first embodiment. In de- 

55 scribing the third embodiment below, reference will 
be made to Fig. 1 and other circuit diagrams at- 
tached. The description of the third embodiment 
will emphasize its differences from the first em- 
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bodiment. Fig. 9 is a view of a typical address 
table for use with the third embodiment. 

(System operation in the absence of faulty drives) 

The system operation of the third embodiment 
in the absence of faulty drives is primarily the 
same as that which takes place under the RAID 5 
scheme. How the system works basically will be 
br-efly described below for a better understanding 
of the RAID 5 scheme. 

In the RAID 3 setup, the write data correspond- 
ing to one write request is divided into a plurality of 
tut data Unrr which parity data is generated. The 
m #!!>;. i* M.t lata and parity data thus generated 
ar- *mtt» " irately to different disc drives. In the 
HMl* it vtu: w contrast, the data corresponding 
t«; or*.- request is not divided. A plurality of 

:VUi iu<n- r ^responding to a plurality of write 
tv nit- st* * * .ised to define parity data. The mul- 
t.i u i;tt;i c-.r> - parity data generated in conjunc- 
tion tivi»*witri constitute one parity group. The data 
am -ht pant,- data are written separately to dif- 
lereni d»sc drives. However, it is not necessary that 
ai! data belonging to the same parity group be 
prepared along with their parity data before it is 
written lo a plurality of disc drives. That is, every 
time a write request is issued by the CPU 1, the 
system generates new parity data based on the 
corresponding write data and on the already gen- 
erated parity data (old parity data) regarding the 
parity group to which the write data in question 
belongs The write data and the generated parity 
data are written separately to two disc drives. 
When the CPU 1 issues a read request, the data 
designated by that request is read from the single 
drive where the data is held; the retrieved data is 
sent to the CPU 1. 

More specifically, a plurality of storage loca- 
tions belonging separately to a plurality of disc 
drives are assigned to each parity group. When the 
CPU 1 issues a write request to the array disc 
controller 2, the logical address designated by the 
write request is assigned a parity group and the 
storage location in any one drive allocated to that 
parity group. Where no data is held yet in the other 
storage locations allocated to the parity group in 
question, the remaining data of that parity group is 
regarded as consisting of a predetermined value 
(e.g., 0). Parity data is generated from the write 
data in question. The write data and the parity data 
thus generated are written separately to different 
disc drives. 

When the same parity group is assigned to a 
second write request, the previously generated par- 
ity data (old parity data) regarding the parity group 
is read out. New parity data is generated from the 
old parity data and from the second data des- 
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ignated by the second write request. The second 
write data is written to the storage location as- 
signed to the data, while the new parity data re- 
places the old parity data. The remaining data of 

5 the parity group is thereafter written in like manner. 
Thus the RAID 5 setup differs from the RAID 3 
setup in that every time data is written, new parity 
data is generated from the write data in question 
and from the old parity data of the parity group to 

io which the data belongs. What is noticeable here is 
that the other data of the same group is not used in 
generating the new parity data. When currently 
retained data is read from any one disc drive, the 
other data of the same parity group to which the 

15 written data belongs is not read out. 

(System operation upon disc drive fault) 

The system operation of the third embodiment 

20 in case of a disc drive fault is primarily the same 
as that of the first embodiment with the exception 
of differences stemming from what sets the RAID 5 
scheme apart from the RAID 3 scheme. Illustra- 
tively, suppose that where a board having a faulty 

25 drive is mounted on the mother board, the system 
executes a write request from the CPU 1 for writing 
data to a normal drive. In this case, as in the case 
where no faulty drive exists, there is no need to 
divide the write data. Suppose also that while the 

30 board carrying the faulty drive is being detached 
from the mother board, the host device issues a 
write request to that faulty drive. In this case, the 
write data corresponding to the write request is 
accommodated temporarily in the cache memories 

35 4. 

When the board having the faulty drive re- 
placed by a normal alternate drive is inserted back 
into the mother board, the data to be held in the 
alternate drive is recovered and written thereto. 

40 Thereafter, the write data retained in the cache 
memories 4 is written without being divided to the 
alternate drive. The same holds for the normal 
drives on the board having the faulty drive, except 
that there is no need to recover the data for these 

45 normal drives. 

(Variations of the third embodiment) 

The variations of the first embodiment may 
50 also be applied to the third embodiment. However, 
there need to be additional modifications attrib- 
utable to the differences between the RAID 3 and 
the RAID 5 scheme. 

For example, suppose that with the first vari- 
55 ation of the first embodiment, the CPU1 issues a 
write request to a faulty drive while the board 
having that faulty drive is being withdrawn from the 
mother board. In this case, as described, the write 
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data is divided into a plurality of sub data and 
parity data is generated in conjunction therewith. Of 
these sub data, the sub data other than those to be 
written to the faulty drive as well as the newly 
generated parity data are written separately to a 
plurality of disc drives. A third embodiment vari- 
ation corresponding to this first variation of the first 
embodiment has no need to divide the write data. 
Suppose that the upper device supplies write data 
while the board having the faulty drive is being 
detached from the mother board. In this case, of 
the multiple data of the parity group to which the 
write data belongs, the data held in the drives other 
than the faulty drive as well as the parity data 
applicable to that parity group are read out. New 
parity data is generated from the data thus read out 
and from the write data supplied anew from the 
CPU 1 . Then the newly generated parity data need 
only replace the old parity data. 

The third embodiment may also be applied to 
RAID 4 array controllers. 

(Fourth embodiment) 

The fourth embodiment of this invention is the 
disc array system of the second embodiment 
adapted to operate on the RAID 5 control princi- 
ples. Thus the fourth embodiment, as with the 
second embodiment, is implemented using the 
same system constitution as that in Fig. 1 sup- 
plemented by the spare drive board 50A. What 
makes the fourth embodiment different from the 
second embodiment is the manner in which the 
system is controlled by the microprocessor 17. In 
describing the fourth embodiment below, reference 
will be made to Fig. 1 and other circuit diagrams 
attached. The description of the fourth embodiment 
will emphasize its differences from the second and 
the third embodiment. 

(System operation in the absence of faulty drives) 

The system operation of the fourth embodi- 
ment in the absence of faulty drives is primarily the 
same as that of the second embodiment with the 
exception of differences stemming from what sets 
the RAID 5 scheme apart from the RAID 3 scheme. 
These differences have already been described in 
connection with what makes the third embodiment 
different from the first embodiment. 

(System operation upon disc drive fault) 

The system operation of the fourth embodi- 
ment in case of a disc drive fault is primarily the 
same as that of the second embodiment with the 
exception of differences stemming from what sets 
the RAID 5 scheme apart from the RAID 3 scheme. 
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These differences have already been described in 
connection with what makes the third embodiment 
different from the first embodiment. 

5 (Variations of the fourth embodiment) 

The variations of the second embodiment may 
also be applied to the fourth embodiment. How- 
ever, there need to be additional modifications at- 
10 tributable to the differences between the RAID 3 
and the RAID 5 scheme. 

For example, where the CPU1 issues a write 
request to a faulty drive and the request is serviced 
using a spare drive, there is no need to divide the 

75 write data as in the case where no faulty drives 
exist. That is, of the multiple data of the parity 
group to which the write data belongs, the data 
held in the drives other than the faulty drive as well 
as the parity data applicable to that parity group 

20 are read out. New parity data is generated from the 
data thus read out and from the write data supplied 
anew from the CPU 1 . The write data is written to 
the spare drive, and the newly generated parity 
data replaces the old parity data. 

25 The fourth embodiment may also be applied to 

RAID 4 array controllers. 

(Fifth embodiment) 

30 Unlike the embodiments described above, the 

fifth embodiment is one which applies where a 
board connector has developed a fault while all 
drives remain normal. 

In case of a poor contact or other contact- 

35 related failure between the mother board and the 
connector of any drive board, all disc drives on that 
board become inaccessible. In that case, it is nec- 
essary to withdraw the board in question from the 
mother board so that the board will be replaced by 

40 a normal board. The fifth embodiment is in fact any 
of the preceding embodiments applied in that 
eventuality. 

When the user detects a faulty connector of 
any board, the user gets the CPU 1 to issue a 

45 command notifying the array controller 2 of the 
impending withdrawal of the board. The user then 
withdraws the board in question from the mother 
board. Thereafter, the system operation is the 
same as that of any of the preceding embodiments 

50 which may be in use. 

A poor connector contact of any drive board 
may be detected as follows: when any one drive on 
a given drive board currently mounted on the 
mother board is accessed unsuccessfully, the array 

55 controller 2 sets in the address table a flag indicat- 
ing a fault of that drive. If other drives on the same 
board are found consecutively to be inaccessible, 
there is a high possibility of a fault other than drive- 
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related faults in the individual drives. This should 
prompt the user to investigate what went wrong. 
When the user detects a poor connector contact of 
the board, the user inputs the command mentioned 
above. If there have been faulty drives detected 
prior to the check on faulty connector contact, the 
user enters commands for turning off the fault bits 
corresponding to such drives. In response to these 
commands, the array controller 2 resets the fault 
bits corresponding to the faulty drives on the des- 
ignated board. 

As described, the disc array system according 
to the invention operates continuously even as the 
board having faulty drives is being detached from 
the mother board in order for such drives to be 
replaced by normal alternate drives. 

In case of a connector fault of any drive board, 
it is possible to execute data read and write re- 
quests designating data in any drive on the board 
in question. 

As many apparently different embodiments of 
this invention may be made without departing from 
the spirit and scope thereof, it is to be understood 
that the invention is not limited to the specific 
embodiments thereof except as defined in the ap- 
pended clams. 

Claims 

1. A disk array system, comprising: 

(a) a group of disc storage devices divided 
into a plurality of logical groups, each logi- 
cal group comprising a plurality of disc stor- 
age devices for holding groups of error cor- 
rection data, each error correction data 
group comprising a plurality of data and 
error correction code for the plurality of 
data; 

(b) a plurality of local boards each for hold- 
ing a number of disc storage devices within 
the group of disc storage devices 

(c) an array controller connected to the 
group of disc storage devices in such a 
manner that each local board can be dis- 
connected from the array controller and can 
be connected to the array controller again 
thereafter, said array controller executing 
read requests and write requests provided 
by an upper device; 

(d) each write request being executed in 
such a manner that (c1 1) an error correction 
code is generated which should belong to 
an error correction data group to which write 
data attached to said each write request 
belongs, and that (c12) the write data and 
the generated error correction code are writ- 
ten into plural disc storage devices within 
the group of disc storage devices as data 



belonging to one error correction data 
group; 

(d2) each read request being executed in 
such a manner that (c21 ) data requested by 
5 said each read request is read from at least 

one of the group of disc storage devices in 
case the one disc storage device is not 
faulty and that (c22) the requested data is 
recovered in case the one disc storage de- 
70 vice is faulty, based upon one error correc- 

tion code and plural other data which be- 
long to one error correction data group to 
which said requested data belongs, and are 
held in other disc storage devices belonging 
15 to one logical group to which said one faulty 

disc storage device belongs; 
wherein said group of disc storage devices 
are held on said plurality of local boards in 
such a manner that disc storage devices be- 
20 longing to a same logical group are held on 

mutually different ones of said plurality of local 
boards. 

2. A disc array device according to claim 1, fur- 
25 ther comprising: a common board for holding 

the plurality of local boards in such a manner 
that each local board can be withdrawn from 
the common board and can be inserted again 
into the common board thereafter; 

30 wherein said plurality of local boards is 

held by the common board in substantially 
perpendicular thereto and substantially in par- 
allel to each other local board; 

wherein said array controller is connected 

35 to the group of disc devices by way of signal 

lines provided on the common board. 

3. A disc array device according to claim 1, 
wherein plural disc storage devices mounted 

40 on a local board each belong to mutually dif- 

ferent logical groups. 

4. A disc array device according to claim 2, 
wherein plural disc storage devices mounted 

45 on a local board each belong to mutually dif- 

ferent logical groups. 

5. A disc array device according to claim 2: 

wherein said array controller accepts one 
50 read request which is supplied by said upper 

device after withdrawal of one of said local 
boards from said common board and which 
requests readout of one data held on one 
faulty disc storage device held on said one 
55 local board; 

wherein said array controller executes said 
accepted one read request, in such a manner 
that said one data is recovered based upon 
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plural other data held In other disc storage 
devices held on local boards other than said 
one local board, and that said recovered one 
data is supplied to said upper device; 

wherein said array controller accepts an- 5 
other read request which is supplied by said 
upper device after said withdrawal and which 
requests readout of another data held in an- 
other normal disc storage device held on said 
one local board; 10 

wherein said array controller executes said 
accepted another request, in such a manner 
tna- said another data is recovered based upon 
piur a • further other data held in further other 
1.1: s- «a>. devices held on local boards oth- 75 
. < nor- iOM-i one local board, and that said 

. v *-f.«^ another data is supplied to said up- 

6 A ci.<, arra> device according to claim 5, fur- 20 
rrw- r f »mr»Msing a random access memory; 

wh..?»-.,n said array controller further ac- 
cept* wnit: requests each of which is supplied 
attor withdrawal of a local board from said 
common board and requires write data to be 25 
wntter into either one of said faulty disc stor- 
age device and said another normal disc stor- 
age device held on said one local board; 

wherein said array controller stores write 
data requested by each of said accepted write 30 
requests; 

wherein said array controller executes said 
write requests by using plural write data re- 
quested thereby and held in said memory, 
after said one faulty disc storage device has 35 
been replaced by a normal replacing disc stor- 
age device and the one local board is inserted 
to said common board again. 

7. A disc array device according to claim 6, 40 
wherein said array controller recovers plural 
data which were held in said one faulty disc 
storage device, and writes said recovered plu- 
ral data into said replacing disc storage device, 
after insertion of said one local board and 45 
before execution of said write requests ac- 
cepted after disconnection of said one local 
board from said common board; 

wherein recovering of said plural data is 
executed based upon plural other data held in 50 
disc storage devices held on local boards oth- 
er than said one local board; 

wherein said other disc storage devices 
are different from said one faulty disc storage 
device but belong to the same logical group; 55 

wherein said plural other data comprise 
plural error correction codes belonging to error 
correction data groups to which said plural 



data held in said one faulty disc storage device 
belong and plural data other than said plural 
data held in said one faulty disc storage device 
and said plural error correction codes among 
plural data belonging to those plural correction 
data groups. 

8. A disc array device according to claim 5, 

wherein said array controller further ac- 
cepts write requests each of which is supplied 
after said said withdrawal of said one local 
board and requires write data to be written into 
either one of said faulty disc storage device 
and another normal disc storage device held 
on said one local board; 

wherein said array controller executes 
each of said accepted write requests before 
said one faulty disc storage device is replaced 
by a normal replacing disc storage device and 
said one withdrawn local board is inserted to 
said common board again, in such a manner 
that an error correction code is generated for 
write data attached to said each write request, 
and is written into other one disc storage de- 
vice held on one of said plurality of local board 
other than said on local board; 

wherein said other one disc storage device 
is one belonging to an error correction data 
group to which one of said faulty disc storage 
device and said another normal disc storage 
device into which said each write data is to be 
written belongs; 

wherein said array controller recovers plu- 
ral data to be held in each of said replacing 
disc storage device and said another normal 
disc storage devices based upon plural other 
data held in other disc storage devices held on 
other local boards other than said one local 
board, and writes said plural data recovered for 
each of said replacing disc storage device and 
said another normal disc storage device there- 
in; 

9. A disc array device according to claim 8, 
wherein recovering of plural data for said an- 
other normal disc storage device is executed 
in such a manner that plural data supplied by 
said upper device to be written into said an- 
other normal disc storage device after said 
withdrawal of said one local board are recov- 
ered but plural data held by said another nor- 
mal disc storage device before said withdrawal 
are not recovered. 

10. A disc array device according to claim 5, fur- 
ther comprising at least one spare disc storage 
device connected to said common board; 

wherein said array controller accepts write 
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requests each of which is supplied by said 
upper device after said withdrawal of said one 
local board and requires write data to be writ- 
ten into one of said one faulty disc storage 
device and another normal disc storage de- 5 
vices held on said one local disc storage de- 
vice; 

wherein said array controller executes 
each accepted write request in such a manner 
that write data required by each accepted write io 
request is written into said spare disc storage 
device; 

wherein said array controller executes a 
read request supplied by said upper device 
after withdrawal of said one local board, so that 75 
data required by the read request is read out 
from said spare disc storage device, in case 
said required data is held in said spare disc 
storage device. 

20 

11. A disc array device according to claim 10, 

wherein said array controller writes data 
recovered as a result of execution of a preced- 
ing read request supplied by said upper device 
after said withdrawal of said one local board, 25 
into said spare disc storage device, in case 
said preceding read request requests data held 
in one of said one faulty disc storage device 
and said another normal disc storage devices 
held on said one withdrawn board; 30 

wherein said array controller reads said 
recovered data written into said spare disc 
storage device, in case a subsequent read 
request supplied by said upper device requires 
read out of same data as the data required by 35 
said preceding read request. 

12. A disc array device according to claim 10, 

wherein said array controller transfers part 
of plural data held in said spare disc storage 40 
device into said another norma! disc storage 
devices held on said one local board, after said . 
one faulty disc storage device is replaced by a 
normal replacing disc storage device and said 
one local board is inserted to said common 45 
board again, said part of plural data including 
plural data which were required to be written 
into said another normal disc storage device 
by write requests supplied by said upper de- 
vice after withdrawal of said one local board. 50 

13. A disc array device according to claim 12, 

wherein, after replacing of said one faulty 
disc storage device and said insertion of said 
one withdrawn local board, said array controller 55 
recovers plural data to be held in said replac- 
ing disc storage device, based on plural data 
held in plural disc storage devices held on 



plural local boards other than said one local 
board, and writes said recovered plural data 
into said replacing disc storage device, said 
plural data recovered including plural data held 
by said faulty disc storage device before said 
faulty disc storage device has become faulty 
and plural data supplied by said upper device 
as data to be written into said faulty disc 
storage device after said withdrawal of said 
one local board. 

14. A disc array device according to claim 5, fur- 
ther comprising a plurality of spare disc stor- 
age devices; 

wherein said array controller selects one of 
said plurality of spare disc storage devices for 
each of said faulty disc storage devices and 
said normal disc storage devices held on said 
one board, at withdrawal of said one local 
board; 

wherein said array controller accepts a 
write request which is supplied by said upper 
device after withdrawal of said one local board 
and which requires write data to be rewritten 
into one of said faulty disc storage device and 
said another normal disc storage devices; 

wherein said array controller executes said 
accepted write request in such a manner that 
said write data is written into one of said plural- 
ity of said spare disc storage devices selected 
for said one disc storage device; 

wherein said array controller accepts a 
read request subsequently supplied by said 
upper device, so that data requested by said 
read request is read out from one of said 
plurality of spare disc storage devices, in case 
said requested data is already held therein. 

15. A disc array device according to claim 14, 
further comprising at least one spare local 
board held on said common board so that said 
spare board can be withdrawn from said com- 
mon board and can be inserted into said com- 
mon board again; 

wherein said plurality of spare disc storage 
devices are held on said spare board, and is 
connected to said array controller by way of 
the signal line provided on said common 
board. 

16. A disc array device according to claim 14, 

wherein said array controller writes data 
recovered as a result of execution of a preced- 
ing read request supplied by said upper device 
after said withdrawal of said one local board, 
into one of said plurality of spare disc storage 
devices selected for said one of said one faulty 
disc storage device and said another normal 
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disc storage device, in case said preceding 
read request requests data held therein; 

wherein said array controller reads said 
recovered data written into said one spare disc 
storage device, in case a subsequent read 5 
request supplied by said upper device requires 
read out of same data as the data required by 
said preceding read request. 

17. A disc array device according to claim 14, w 

wherein said array controller transfers part 
of plural data held in one of said plurality of 
spare disc storage devices selected for said 
another normal disc storage device held on 
said one local board into said another normal 15 
disc storage devices, after said one faulty disc 
storage device is replaced by a normal replac- 
ing disc storage device and said one local 
board is inserted to said common board again, 
said part of plural data including plural data 20 
which were required to be written into said 
another normal disc storage device by write 
requests supplied by said upper device after 
withdrawal of said one local board. 

25 

18. A disc array device according to claim 14, 

wherein, after replacing of said one faulty 
disc storage device and said insertion of said 
one withdrawn local board, said array controller 
recovers plural data to be held in said replac- 30 
ing disc storage device, based on plural data 
held in plural disc storage devices held on 
plural local boards other than said one local 
board, and writes said recovered plural data 
into said replacing disc storage device, said 35 
plural data recovered including plural data held 
by said faulty disc storage device before said 
faulty disc storage device has become faulty 
and plural data supplied by said upper device 
as data to be written into said faulty disc 40 
storage device after said withdrawal of said 
one local board. 

19. A disc array device according to claim 14, 

wherein, after said replacing of said one 45 
faulty disc storage device and said inserting of 
said one withdrawn local board, said array con- 
troller recovers plural data which was held by 
said one faulty disc storage device before said 
one board has been withdrawn from said com- 50 
mon board, and writes said recovered plural 
data into said replacing disc storage device; 

wherein said array controller further trans- 
fers plural data supplied by said upper device 
after said one faulty disc storage device has 55 
become faulty from one of said plurality of 
spare disc storage devices selected for said 
one faulty disc storage device, to said replac- 



ing disc storage device. 

wherein said recovering is executed based 
on plural data held in plural disc storage de- 
vices held on plural local boards other than 
said one local board. 

20. A disc array device according to claim 17, 

wherein, after said one faulty drive has 
become faulty, said array controller recovers 
plural data which was held by said one faulty 
disc storage device before said one board has 
become faulty, and transfers said recovered 
plural data to one of said plurality of spare disc 
storage devices selected for said one faulty 
disc storage device; 

wherein said array controller executes read 
requests and write requests supplied by said 
upper device after said one faulty disc storage 
device has become faulty so that the read 
requests and the write requests are executed 
by using said one spare disc storage device 
selected for said one faulty disc storage de- 
vice, in case said read requests and said write 
requests designates data held in said one faul- 
ty disc storage device. 

21. A disc array device according to claim 20, 

wherein, after said replacing of said one 
faulty disc storage device and said insertion of 
said one local board, said array controller 
keeps using said one spare disc storage de- 
vice selected for said one faulty disc storage 
device, in place of said one faulty disc storage 
device, and uses said replacing disc storage 
device as one of said plurality of spare disc 
storage devices. 

22. A disc array device according to claim 21, 
further comprising at least one spare local 
board held on said common board so that said 
spare board can be withdrawn from said com- 
mon board and can be inserted into said com- 
mon board again; 

wherein said plurality of spare disc storage 
devices are held on said spare board, and is 
connected to said array controller by way of 
the signal line provided on said common 
board. 

2a A disc array device according to claim 17, 

wherein, after withdrawal of said one local 
board, said array controller recovers plural data 
which was held by said one faulty disc storage 
device before said one board has become 
faulty, and writes said recovered plural data 
into one of said plurality of spare disc storage 
devices selected for said one faulty disc stor- 
age device; 
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wherein said array controller executes read 
requests and write requests supplied by said 
upper device after said one faulty disc storage 
device has become faulty, so that the read 
requests and the write requests are executed 5 
by using said one spare disc storage device 
selected for said one faulty disc storage de- 
vice, in case said read requests and said write 
requests designates data held in said one faul- 
ty disc storage device. io 

24. A disc array device according to claim 20, 

wherein, after replacing of said one faulty 
disc storage device and said inserting of said 
one local board, said array controller keeps 15 
using said one spare disc storage device se- 
lected for said one faulty disc storage device 
in place of said one faulty disc storage device, 
and uses said replacing disc storage device as 
one of said plurality of spare disc storage 20 
devices. 

25. A disc array device according to claim 24, 
further comprising at least one spare local 
board held on said common board so that said 25 
spare board can be withdrawn from said com- 
mon board and can be inserted into said com- 
mon board again; 

wherein said plurality of spare disc storage 
devices are held on said spare board, and is 30 
connected to said array controller by way of 
the signal line provided on said common 
board. 

26. A disc array device according to claim 1, 35 

wherein said array controller executes 
each write request supplied by said upper de- 
vice, in such a manner that write data required 
by said each write request is divided into a 
plurality of sub-data, generates an error cor- 40 
recting code for said plurality of sub-data, and 
write said plurality of sub-data and said gen- 
erated error correction code into plural disc 
storage devices held on mutually different local 
boards, as data belonging to one error correc- 45 
tion data group; 

wherein said array controller executes 
each read request supplied by said upper de- 
vice, in such a manner that a plurality of sub- 
data which form data requested by said read 50 
request are read from plural disc storage de- 
vices held on mutually different local boards, 
said plurality of sub-data combined into one 
data, and said one data is supplied to said 
upper device. 55 

27. A disc array device according to claim 1 , 

wherein an error correction data group 



comprises a plurality of write data required by 
a plurality of write requests supplied by said 
upper device and an error correction code for 
said plurality of write data; 

wherein said array controller executes 
each write request supplied by said upper de- 
vice, in such a manner that said old write data 
to be updated by write data attached to said 
each write request is read and an old error 
correction code belonging to an error correc- 
tion data group to which said write data be- 
longs, a new error correcting code for said 
error correction data group is generated by 
said write data, said old error correcting code 
and said new error correcting code, and said 
write data and said generated new error cor- 
rection code are written into plural disc storage 
devices held on mutually different local boards, 
as data belonging to said error correction data 
group; 

wherein said array controller executes 
each read request supplied by said upper de- 
vice, in such a manner that data requested by 
said read request is read from one of disc 
storage devices held on one of said plurality of 
local boards, and is supplied to said upper 
device. 

28. A disc array device according to claim 1: 

wherein said array controller accepts one 
read request which is supplied by said upper 
device after withdrawal of one of said local 
boards from said common board and which 
requests readout of one data held on one 
faulty disc storage device held on said one 
local board; 

wherein said array controller executes said 
accepted one read request, in such a manner 
that said one data is recovered based upon 
plural other data held in other disc storage 
devices held on local boards other than said 
one local board, and that said recovered one 
data is supplied to said upper device; 

wherein said array controller accepts an- 
other read request which is supplied by said 
upper device after said withdrawal and which 
requests readout of another data held in an- 
other normal disc storage device held on said 
one local board; 

wherein said array controller executes said 
accepted another request, in such a manner 
that said another data is recovered based upon 
plural further other data held in further other 
disc storage devices held on local boards oth- 
er than said one local board, and that said 
recovered another data is supplied to said up- 
per device. 
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29. A disc array device according to claim 28, 
further comprising a random access memory; 

wherein said array controller further ac- 
cepts write requests each of which is supplied 
after said said withdrawn and requires write 5 
data to be written into either one of said faulty 
disc storage device and said another normal 
disc storage device held on said one local 
board; 

wherein said array controller stores write 10 
data requested by each of said accepted write 
requests; 

wherein said array controller executes said 
write requests by using plural write data re- 
quested thereby and held in said memory, 15 
after said one faulty disc storage device has 
been replaced by a normal replacing disc stor- 
age device and the one local board is inserted 
to said common board again. 

20 

30. A disc array device according to claim 28, 
further comprising a plurality of spare disc 
storage devices; 

wherein said array controller selects one of 
said plurality of spare disc storage devices for 25 
each of said faulty disc storage devices and 
said normal disc storage devices held on said 
one board, at withdrawal of said one local 
board; 

wherein said array controller accepts a 30 
write request which is supplied by said upper 
device after withdrawal of said one local board 
and which requires write data to be rewritten 
into one of said faulty disc storage device and 
said another normal disc storage devices; 35 

wherein said array controller executes said 
accepted write request in such a manner that 
said write data is written into one of said plural- 
ity of said spare disc storage devices selected 
for said one disc storage device; 40 

wherein said array controller accepts a 
read request subsequently supplied by said 
upper device, so that data requested by said 
read request is read out from one of said 
plurality of spare disc storage devices, in case 45 
said requested data is already held therein. 

31. In a disk array device, having: 

a group of disc storage devices divided 
into a plurality of logical groups, each logical 
group comprising a plurality of disc storage 
devices for holding error correction data 
groups, each error correction data group com- 
prising a plurality of data and error correction 
data for the plurality of data; 

a plurality of local boards each for holding 
a number of disc storage devices within the 
group of disc storage devices; and 



an array controller connected to the group 
of disc storage devices in such a manner that 
each local board can be disconnected from the 
array controller and can be connected to the 
array controller again thereafter, said array 
controller executing read requests and write 
requests provided by an upper device, a disc 
array access method, comprising the steps of; 

(a) determining plural disc storage device 
which should belong each logical group, in 
such a manner that said plural disc storage 
device are held on mutually different ones 
of said plurality of local boards; 

(b) disconnecting one of said plurality of 
local boards which holds one faulty disc 
storage device, from said array controller at 
occurrence of a fault to said one disc stor- 
age device; 

(c) keeping accepting read requests each of 
which is supplied by an upper device after 
the disconnecting and requires readout of 
data held in one of said one faulty disc 
storage device and another normal storage 
device held on said one local board; and 

(d) executing each read request, in such a 
manner that read data required by said 
each read request is recovered, based upon 
plural data held on plural disc storage de- 
vices held on plural local boards other than 
said one disconnected board, and that said 
recovered data is supplied to said upper 
device. 

32. A disc array access method according to claim 
31, 

wherein said disc array device further 
comprises a random access memory; 

wherein said method further comprises the 
steps of: 

(e) keeping accepting write requests each of 
which is supplied by an upper device after 
the disconnecting of said local board and 
requires write data to be written into one of 
said faulty disc storage device and said 
another normal disc storage devices held on 
said one local board; 

(f) temporally holding plural write data re- 
quired by said accepted write requests in 
said memory; and 



50 (g) executing said plural write requests by 

using said plural write data held in said 
memory, after said one faulty disc storage 
device is replaced by a replacing normal 
disc storage device and the one local board 
55 is connected to said array controller again. 

33. A disc array access method according to claim 
31, 
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wherein said disc array device further 
comprises at least one spare disc storage de- 
vice connected to said array controller; 

wherein said method further comprises the 
steps of: 

(e) keeping receiving write requests each of 
which is supplied by an upper device after 
the disconnecting of said local board and 
requires write data to be written into one of 
said faulty disc storage device and said 
another norma! disc storage devices held on 
said one local board; 

(f) executing each accepted write request in 
such a manner that write data requested by 
said each write request is written into said 
spare disc storage device; and 

(g) after said one faulty disc storage device 
has been replaced by a normal replacing 
disc storage device and the one local board 
is connected to said array controller again, 
transferring at least part of plural write data 
held in said spare disc storage device to 
said another normal disc storage device 
held on the one local board, said part of 
said plural write data being ones given by 
said upper device as ones to be written into 
said another normal disc storage devices. 

34. A disc array access method according to claim 

31, 

wherein said disc array device further 
comprises a plurality of spare disc storage 
devices connected to said array controller; 

wherein said method further comprises the 
steps of: 

(e) selecting one of said plurality of spare 
disc storage devices for each of said one 
faulty disc storage device and said another 
normal disc storage device held on said one 
local board at disconnecting of said one 
local board; 

(f) keeping receiving write requests each of 
which is supplied by an upper device after 
the disconnecting and requires write data to 
be written into one of said faulty disc stor- 
age device and said another normal disc 
storage devices held on said one local 
board; 

(g) recovering at said disconnecting of said 
one local board, plural data which were al- 
ready held in said one faulty disc storage 
device before said one faulty disc storage 
device has become faulty, based upon plu- 
ral data held on disc storage devices held 
on plural local boards other than said one 
local board; 

(h) writing said recovered plural data into 
one of said plurality of spare disc storage 



devices selected for said one faulty disc 
storage device; 

(i) executing said accepted write requests in 
such a manner that write data required by 

5 each of said accepted write request is writ- 

ten into one of said spare disc storage 
devices selected for one of said one faulty 
disc storage device and said another normal 
disc storage device into which said required 

io write data is to be written; 

(j) keeping using said one spare disc stor- 
age device selected for said one faulty disc 
storage device in place of said one faulty 
disc storage device after said one faulty 

75 disc storage device has been replaced by a 

normal replacing disc storage device and 
the one local board is connected to said 
array controller again; and 
(k) using said replacing disc storage device 

20 as a new spare disc storage device. 

35. A disc array access method according to claim 
34, further comprising the steps of: 

(I) after said connecting again, transferring 

25 plural write data held in another one of said 

plurality of spare disc storage devices se- 
lected for said another normal disc storage 
device to said another normal disc storage 
device held on the one local board; and 

30 (m) releasing said another spare device as 

a new spare device. 
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